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In classical evolutionary theory, genetic variation provides the source of heritable phenotypic vari- 
ation on which natural selection acts. Against this classical view, several theories have emphasized 
that developmental variability and learning enhance nonheritable phenotypic variation, which in 
turn can accelerate evolutionary response. In this paper, I show how developmental variability al- 
ters evolutionary dynamics by smoothing the landscape that relates genotype to fitness. In a fitness 
landscape with multiple peaks and valleys, developmental variability can smooth the landscape to 
provide a directly increasing path of fitness to the highest peak. Developmental variability also 
allows initial survival of a genotype in response to novel or extreme environmental challenge, pro- 
viding an opportunity for subsequent adaptation. This initial survival advantage arises from the 
way in which developmental variability smooths and broadens the fitness landscape. Ultimately, the 
synergism between developmental processes and genetic variation sets evolutionary ratj^ 



In evolutionary biology, environmentally induced 
modifications come under unfinished business . . . 
There have been repeated assertions of both their 
importance and their triviality, a lot of discussion 
with no consensus. . . . Yet the debate has contin- 
ued over such concepts as genetic assimilation, 
the Baldwin effect, organic selection, morphoses, 
and somatic modifications. So much controversy 
over the span of a century suggests that a prob- 
lem of major significance remains unsolved |ll 
p. 498]. 

I. INTRODUCTION 

A single genotype produces different phenotypes. De- 
velopmental programs match the phenotype to differ- 
ent environments. Intrinsic developmental fluctuations 
spread the distribution of phenotypes. Extrinsic en- 
vironmental fluctuations perturb developmental trajec- 
tory. These nonheritable types of phenotypic variation 
are common. 

Nonheritable phenotypic variation is not transmitted 
through time. Thus, nonheritable variation would seem 
to be irrelevant for evolutionary change, which instead 
depends on the genetic component of variation. However, 
nonheritable phenotypic variation can, in principle, affect 
evolutionary rate. At first glance, that contribution of 
nonheritable phenotypic variation to evolutionary rate 
appears to be a paradox. 

Many different theories, commentaries, and controver- 
sies turn on this paradox (Box[2|. The literature has 
followed a consistent pattern. Detailed theories relate de- 
velopmental variability to accelerated evolution. Coun- 
terarguments ensue. Listings of complicated examples 



claim to support the theory. Refinements to the theory 
develop. 

In the end, few compelling examples relate nonheri- 
table phenotypic variability to evolutionary rate. The 
literature is hard to read. Enthusiasts extend the con- 
cepts and keep the problem alive. Through the enthusi- 
asts' promotions, many have heard of the theory. But, 
in practice, few consider the role of nonheritable phe- 
notypic variability in their own analyses of evolutionary 
rate. Almost everyone ignores the problem. 

In this article, I emphasize simple theory that re- 
lates nonheritable phenotypic variability to evolutionary 
rate. Understanding the paradoxical relation between 
nonheritable phenotypic variability and evolutionary rate 
is an essential step in reasoning about many evolutionary 
problems. 

This article is primarily a concise tutorial to the ba- 
sic concepts (see Box[T]). I briefly mention some of the 
history (Box |2| and recent, more advanced literature 
(Boxg. 



II. SMOOTHING THE EVOLUTIONARY PATH 

The distribution of phenotypes for a given genotype 
is called the reaction norm. All theories come down to 
the fact that a broad reaction norm smooths the path of 
increasing fltness. Once one grasps the smoothing pro- 
cess, many apparently different theories become easy to 
understand. 

The next section gives the mathematical expression for 
the smoothing of fltness by the reaction norm. Fig. [l] 
explains the mathematics with a simple example. 



A. The reaction norm smooths fitness 
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Box 1. Topics in the theory of natural selection 



This article is part of a series on natural selection. Al- 
though the theory of natural selection is simple, it remains 
endlessly contentious and difficult to apply My goal is to 
make more accessible the concepts that are so important, yet 
either mostly unknown or widely misunderstood. I write in 
a nontechnical style, showing the key equations and results 
rather than providing full derivations or discussions of math- 
ematical problems. Boxes list technical issues and brief sum- 
maries of the literature. 



Second, the phenotype expressed varies according to 
the reaction norm. Read p{x\x) as the probability of ex- 
pressing the phenotype x given a genotype with average 
phenotype x. 

Third, we must calculate F{x)^ the expected fitness 
for a genotype with average phenotype x. We obtain 
the expected fitness by summing up the probability, p, of 
expressing each phenotype multiplied by the fitness, /, 
of each phenotype. That sum is 

F{x) = Y,p{x\x)f{x), (1) 

taken over all the different phenotypes, x. We often mea- 
sure X as a continuous variable. The sum is then equiv- 
alently written as 

F{x) = J p{x\x)f{x)dx. (2) 

This equation shows how one averages the fitness, /(x), 
for each phenotypic value, x, over the reaction norm, 
p{x\x), to obtain the expected fitness of a genotype, F{x). 
We label each genotype by its average phenotype, x. The 
expected fitness of a genotype, F{x)^ is what matters for 
evolutionary process [2 . 

The averaging of expected fitness over the reaction 
norm is the key to the entire subject. Averaging over the 
reaction norm, p, flattens and smooths the fitness func- 
tion, /. This smoothing makes the curve for expected 
fitness, F, have lower peaks and shallower valleys than 
the original fitness curve, /. The smoothing of F changes 
evolutionary dynamics. The whole problem comes down 
to understanding how reaction norms smooth fitness, and 
the consequences of a smoother relation between geno- 
type and fitness. 



B. Example of continuous smoothing 

Fig. [l] shows an example of smoothing with discrete 
distributions. It will often be convenient to consider 
smoothing of continuous variables. Fig. |2] shows an ex- 
ample. The following expressions describe the underlying 
mathematics. 



Box 2. Historical overview 



Schlichting and Pigliucci [3] and West-Eberhard fT] thor- 
oughly review the subject. Here, I highlight a few key points 
in relation to this article. I treat learning and developmental 
plasticity as roughly the same with regard to potential con- 
sequences for evolutionary rate, although one could certainly 
choose to focus on meaningful distinctions. 

In my own reading during the 1980s, I had found the re- 
lation between learning and evolutionary rate intriguing but 
confusing. Baldwin's [4] idea that learning can accelerate evo- 
lutionary rate seemed attractive. Mayr 5 , in his monumen- 
tal review of biological thought, also discussed various ways 
in which behavior or flexible developmental programs might 
alter evolutionary dynamics. Those ideas seemed potentially 
important, but it was not easy to grasp the essence. The 
literature at that time was not helpful, with a lot of jargon 
and sometimes almost mystical commentary mixed in with 
intriguing and creative ideas. 

It was clear that learning could slow evolutionary rate. 
Different genotypes could, through learning, end with the 
same phenotype. Reducing the phenotypic distinction be- 
tween different genotypes would generally slow evolutionary 
rate. The more intriguing problem concerns the origin of evo- 
lutionary novelty or the response to novel or extreme environ- 
mental challenge. Environmental novelty and acceleration of 
evolutionary response were the primary concern of Baldwin 
[4 , Waddington [6l |7], and West-Eberhard [T]. My article 
also focuses on acceleration of evolutionary response. 

Hinton and Nowlan |8] clarified the subject with their sim- 
ple conclusion that: 

Learning alters the shape of the search space 
in which evolution operates and thereby pro- 
vides good evolutionary paths towards sets of co- 
adapted alleles. We demonstrate that this effect 
allows learning organisms to evolve much faster 
than their nonlearning equivalents, even though 
the characteristics acquired by the phenotype are 
not communicated to the genotype. 

During the past few decades, the fundamental role of 
smoothed fitness surfaces in biology has not always been rec- 
ognized as fully as it should be, in spite of several fine pa- 
pers along that line (see Box|3|. Interestingly, certain com- 
puter optimization algorithms take advantage of the increased 
search speed provided by a process similar to smoothed fitness 
landscapes [9l 110]. 



In Fig. [2j the reaction norm follows a normal distribu- 
tion. In symbols, we write 

P{X\X) r^N{x,-f'^), 

which we read as the probability, p, of a phenotype, x, 
for a reaction norm centered at x, follows a normal dis- 
tribution with mean x and variance 7^. 
For fitness, we write in symbols 

which we read as the fitness, /, of a phenotype, has the 
shape of a normal distribution with mean and variance 
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FIG. 1. The reaction norm smooths the fitness landscape. This simple example illustrates the calculation of the expected 
fitness for each genotype, following Eq. ([T]). (a) The calculation of expected fitness, F{x), for the smallest average phenotype, 
X — —3. For that average phenotype, the reaction norm, shows the probabilities of expressing different phenotypes, 

X. In this case, the peak of the reaction norm matches the average value, and each phenotype ±1 occurs half as often as the 
peak value. To get the expected fitness for a reaction norm centered at x = —3, one sums up the probability p{x\x) for each 
phenotype, x, multiplied by the fitness for each phenotype, /(x). The arrows illustrate the summation, (b) The expected 
fitness, F(x), for each increase in x, is calculated by the same summation process, shifting the reaction norm to the right by 
one to get the proper value for each x. (c) The full transformation is shown between the fitness for each phenotypic value, /(x), 
and the expected fitness, F(x), for each genotype with reaction norm p(x|x) and average phenotype x. The reaction norm 
smooths the multipeaked fitness function, /(x), into the single-peaked fitness function F[x). Evolutionary dynamics depend 
on genotypic fitnesses, F. Thus, the reaction norm transforms fitness into a smooth function that allows a direct increasing 
path to the fitness peak from any starting value for average phenotype. 



cr^. In this case, we assume the center of the fitness 
distribution is at a phenotypic value of zero to give a 
fixed point for comparison — any value to center fitness 
could be used. The important issue is that fitness falls 
off from its peak by the pattern of a normal distribution. 
The width of the fitness function is set by the variance 
parameter, a^. 

We can now use Eq. ([2| to calculate the expected fit- 
ness of a genotype with average phenotype yielding 

F(x)^A/'(0,7'+a2). (3) 

This equation shows that smoothing by the reaction 
norm, p, flattens and widens the shape of the fitness func- 
tion by increasing the variance of the expression for F. 



C. Evolutionary response to novel or extreme 
challenge 

If a genotype expresses an average phenotype close to 
the maximum fitness, then a narrow reaction norm has 
higher fitness than a broad reaction norm. The lower 
plots of F{x) in Fig. |2] illustrate contrasting widths of 
reaction norms. Near the peak, the average phenotype 
closely matches the optimum, and the narrower reaction 
norm has higher fitness. This advantage occurs because a 



narrow reaction norm expresses fewer phenotypes in the 
tails, away from the optimum. 

For genotypes with an average phenotype far from the 
maximum fitness, a broad reaction norm has higher fit- 
ness than a narrow reaction norm. Fig. |3] illustrates this 
advantage for broad reaction norms. In that figure, both 
reaction norms are centered at x. Only those phenotypes 
above the fitness truncation point survive. The broad re- 
action norm produces some individuals with phenotypes 
above the truncation point, whereas the narrow reaction 
norm has zero fitness. 

If the environment poses a novel or extreme challenge, 
the broad reaction norm wins. By contrast, in a stable 
environment for which the current average phenotype is 
close to the fitness optimum, the narrow reaction norm 
wins. Thus, extreme or novel environmental challenges 
or intense competition favor a broad reaction norm. 

Haldane [11 made a similar point when he said: "In- 
tense competition favors variable response to the envi- 
ronment rather than high average response. Were this 
not so, I expect that the world would be much duller 
than is actually the case." Holland's [12] emphasis on 
exploration versus exploitation is perhaps closer to the 
problem here. Broad reaction norms are favored when 
exploration of novel challenges dominates, whereas nar- 
row reaction norms are favored when exploitation domi- 
nates. Fluctuating environments may also favor a broad 
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FIG. 2. Reaction norms and fitness for continuous pfieno- 
types. Eacfi column sfiows fiow tfie reaction norm, p{x\x), 
smootfis tfie fitness function, /(x), to give tfie expected fit- 
ness, F{x), for a genotype witfi average pfienotype x. The 
smootfiing follows Eq. (121. These examples use normal distri- 
butions that lead to Eq.Tsl. (a) The solid and dashed reaction 
norms follow A/'(x, 1/2) and A/'(x,5), respectively. Fitness, 
/(x), has the shape of a normal distribution with vanishingly 
small variance, A/'(0,cr^ 0). Thus, expected fitness, 
is the same as the reaction norm, (b) The same structure 
as in (a), except that f{x) is much wider, following A/'(0,7). 
Thus, F(x) now has curves A/'(0, 7.5) and A/'(0, 12) for solid 
and dashed curves, respectively. In each plot, the baseline is 
set to 4.3% of the peak in that plot. The baseline truncates 
phenotypes with low vigor, setting their fitnesses to zero. 




X 

Phenotype, x 

FIG. 3. Novel environmental challenge or intense competi- 
tion favors a broad reaction norm. In this example, both the 
broad and narrow reaction norms are centered at x. Pheno- 
types above the truncation point survive. Phenotypes below 
the truncation point die. None of the phenotypes for the nar- 
row reaction norm are above the truncation point, so all die. 
Some of the phenotypes of the broad reaction norm survive. 
Those surviving phenotypes may evolve so that their average 
phenotype, x, moves toward the truncation point, improv- 
ing fitness over time. Improvement occurs if there is genetic 
variation for the average phenotype, x, of the reaction norm. 

with a single peak (Fig.|4]3). A broad reaction norm 
will typically perform badly near a fitness peak, but allow 
much more rapid evolutionary advance to a higher fitness 
peak. Once again, we see that broad reaction norms ex- 
ploit current fitness opportunities relatively poorly but 
gain by enhanced exploration and achievement of novel 
adaptations. 



reaction norm to increase the chance of matching what- 
ever is favored at any time [2 . Here, I focus on constant 
challenges to extreme or novel environments. 



D. Smoothly increasing fitness path in a multipeak 
fitness landscape 

Much discussion in evolutionary theory concerns how 
populations shift from a lower fitness peak to a higher 
fitness peak [13 . For example, in the fitness landscape, 
/(x), of Fig. |4]3, a population starting on a lower peak 
must evolve through a valley of lower fitness in order to 
follow an increasing path to a higher fitness peak. Natu- 
ral selection typically follows a path of increasing fitness, 
so a population may be trapped on a lower peak. 

Most evolutionary analyses use a fitness landscape that 
relates phenotype, x, to fitness, f{x). However, the 
proper measure should relate the average phenotype of a 
genotype, x, to the expected fitness, F{x) [2]. 

A sufficiently broad reaction norm smooths a multi- 
peak fitness landscape, /(x), into a smooth landscape. 



III. DIMENSIONALITY AND DISCOVERY 

The reaction norm may be generated randomly by per- 
turbations in development. If so, then exploration of the 
fitness landscape by a broad reaction norm is a type of 
random search. Figs. |3] and |4] show that random search 
can greatly increase the rate of adaptation, particularly 
to novel environmental challenges. 

Those previous examples showed the reaction norm 
and fitness both varying across a single dimension. A 
broad reaction norm spreads phenotypes along that sin- 
gle dimension, increasing the chance that some individu- 
als will have high fitness. 

Now consider the much more difficult search problem 
that arises in higher dimensions [14]. Suppose, for ex- 
ample, that adapting to a novel environmental challenge 
requires multiple phenotypic changes to work together in 
a harmonious way. Think of each particular phenotypic 
change as a trait in its own dimension, so that the search 
now occurs in multiple dimensions. If the reaction norm 
simply generates random phenotypes in each dimension, 
then there is little chance of getting simultaneous match- 
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FIG. 4. A broad reaction norm smooths a multipeak fitness 
landscape, (a) The dashed curve shows the broader reaction 
norm, (b) The fitness landscape for each particular 

phenotype, /(x), has multiple peaks, (c) The broad reaction 
norm smooths the fitness landscape to a single peak for the 
relation between the average phenotype for a genotype, x, 
and fitness, F(x). In this example, the narrow and broad 
reaction norms follow A/'(0,7^) distributions with variances 
of 0.04 and 0.16, respectively. Fitness is given by f(x) — 
Ej=-i(3|l + + l)A/'(i,cr2), with cr^ = 0.0225. The value 
of F{x) is calculated from Eq. ([2]), yielding the expression 
for j(x) in the prior sentence with the variance replaced by 
cr^ +7^. The baseline truncates small values. 



ing phenotypes in multiple dimensions. 

To visualize the multidimensional problem, begin with 
the one-dimensional fitness landscape in Fig. |4]3. Now 
consider two phenotypic dimensions. Assume that fit- 
ness concentrates along one dimension, as in Fig. [5^. In 
that plot, only a narrow band of phenotypes along the 
second phenotypic dimension produces viable individu- 
als. In the first dimension, fitness rises and falls along 
the same peaks and valleys as in Fig. |4]3. Thus, both 
figures show essentially the same fitness landscape, but 
in the second case the nearly one-dimensional landscape 
is embedded in a second dimension (fitnesses scale loga- 
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Phenotype 

FIG. 5. A broad reaction norm performs poorly when fit- 
ness is concentrated in a lower dimension, (a) The bivariate 
analogy of the fitness landscape in Fig. [4}), scaled logarith- 
mically. The primary dimension has variance (j\ — 0.0225 
corresponding to standard deviation g\ — 0.15, as in Fig.|4]D. 
The secondary (narrow) dimension has standard deviation 
(72 = O.lcTi. (b) Fitness landscape smoothed by a reaction 
norm concentrated in the same dimension as fitness. The 
variance of the reaction norm in the primary dimension is 
7i = 0.16, and standard deviation is 71 = 0.4, as in the 
dashed reaction norm of Fig. [4^. The standard deviation in 
the secondary dimension is 72 = O.OI71. The smoothed fit- 
ness surface rises steadily to a peak along its ridge in the pri- 
mary dimension, tracing the same path as the dashed curve 
in Fig. [4]:. (c and d) Increasingly broad reaction norms in the 
secondary dimension with standard deviations of O.I71 and 
71, respectively. The baseline truncates small fitness values, 
which are considered inviable. 



rithmically in Fig. [5| . 

In two dimensions, the reaction norm will smooth phe- 
notypes along both trait axes. When the reaction norm 
varies mostly along the same dimension as the variation 
in fitness, as in Fig.jS}), then we obtain the same smooth- 
ing as in one dimension (dashed curve of Fig.|4]3). When 
the reaction norm varies in both directions, as in Fig.[5]i, 
then the smoothed surface has very low fitness even at its 
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FIG. 6. Decline in fitness with an increasingly broad reac- 
tion norm away from the primary concentration of fitness. 
The dashed lines show the fitness associated with the high, 
medium and low peaks of the fitness landscape in Fig. [5^. The 
solid curve shows the highest point of the fitness functions 
smoothed by the reaction norms of Fig. |5]3-d, with standard 
deviation of the reaction norm increasing in the secondary di- 
mension. The secondary dimension standard deviation value 
shown in the plot gives the amount by which the primary 
dimension standard deviation is multiplied in the second di- 
mension. The multipliers 0.01, 0.1, 1.0 correspond to the three 
smoothed fitness surfaces in Fig. [5]3-d. When the secondary 
dimension is narrow, for example, reduced in width by a fac- 
tor 0.01, then the smoothed fitness peak is higher than the 
intermediate fitness peak of the unsmoothed landscape, as in 
Fig. [4]:. As width in the secondary dimension increases, the 
cost of exploring in a dimension away from the concentration 
of fitness causes the peak of the smoothed fitness landscape 
to drop very low, illustrating the very high cost of exploring 
in more dimensions than the concentration of fitness. 

peak. The low fitness occurs because the randomly gen- 
erated reaction norm produces phenotypes spread across 
two dimensions. Most of those phenotypes fall off of the 
one dimensional concentration of fit phenotypes. 

In general, when the dimensionality of the reaction 
norm exceeds the dimensionality of the fitness concen- 
tration, then a random search process is inefficient. The 
cost of exploration is so high that even the best average 
phenotype for a genotype has fitness, lower than 

the lowest peak of the fitness landscape, f{x). Here, x 
and X represent multidimensional phenotypes. Fig. [6] il- 
lustrates the cost of exploration in relation to the spread 
across dimensions. 

In summary, if the space of possible trait combinations 
spreads over greater dimensions than the concentration 
of fitness, then randomly generated variations will pro- 
duce mostly worthless variants. The search cost is high, 
and average performance for a widely spread reaction 
norm is low. The smoothed fitness surface may have a 
steadily rising path to its fitness maximum from many 
initial points, but the height of the fitness peak is so low 
that a broad reaction norm will often be strongly selected 
against. 

The following sections describe two processes that may 
offset the high cost of developmental variation. First, the 
broad search space may be covered by genetic variants. 



Box 3. Recent literature 



Ancel analyzed smoothed fitness surfaces and the con- 
sequences for evolutionary rate. She emphasized three impor- 
tant points. 

First, learning accelerates evolution only under certain 
conditions. The examples in the text illustrate this point by 
showing that learning mainly accelerates evolution through 
discovery of viable phenotypes or in the smoothing of a mul- 
tipeaked fitness surface. Otherwise, the smoothing of fitness 
surfaces may lower the maximum fitness that can be attained, 
reducing the slope and the evolutionary rate to the peak. 

Second, although learning may accelerate evolution, it is 
not necessarily true that learning evolved because it accel- 
erates evolution. The evolutionary consequence of a trait is 
distinct from whether or not the trait evolved because of its 
potential to alter subsequent evolutionary dynamics. The lit- 
erature discusses this distinction under the topic of evolv- 
ability. Evolvability has developed into a large subject of its 
own 16-20 . Holland's [12 distinction between exploration 
and exploitation captures aspects of the later developments 
on evolvability. 

Third, Ancel noted historical precedents for the idea that 
phenotypic variance may eliminate otherwise uncrossable val- 
leys in fitness landscapes 21-23 . 

A large literature develops issues related to Ancel's three 
points and the broader problems of how reaction norms affect 
fitness surfaces. I list a small sample [141 124f[42] . 

West-Eberhard discusses many empirical issues and ex- 
amples. Recent studies in evolutionary biology provide new 
data or summaries of the literature |43|j44j. Articles in micro- 
biology and cancer research have also developed the relation 
between nonheritable phenotypic variation and evolutionary 
process [451453] . 

In the text, I discuss the synergism between genetic and 
developmental variation. I am not aware of literature related 
to that issue. However, given the many papers on the general 
topic, the synergism between genetics and development may 
have come up previously. 



with developmental variation searching only the local re- 
gions around each genotypic variant. 

Second, developmental variation may be biased in a 
way that tends to match the environment. If a develop- 
mental or learning process brings the phenotype close to 
the concentration of fitness in a high dimensional space, 
then some additional random variation can greatly in- 
crease the rate of adaptation. In this case, the fitness 
surface is smoothed to provide a steady path of increas- 
ing fitness, and the developmental bias that brings the 
center of the phenotypic distribution close to the fitness 
concentration mitigates the large cost of search in high 
dimensional phenotypic spaces. 

IV. SYNERGISM BETWEEN PHENOTYPIC 
AND GENETIC VARIATION 

A broad reaction norm may enhance survival and sub- 
sequent opportunity for improved fitness. But those ben- 



7 



efits arise only when a genotype is sufficiently close to a 
fitness peak. Fig. [7| illustrates the problem. 

Fig. [7^ shows the fitness peak in a novel environment. 
The dots show the locations of alternative genotypes, 
placed by their average phenotypes in two dimensions. 
Neither genotype has positive fitness. Both will die out. 
In that plot, the fitness peak is the direct fitness land- 
scape, unsmoothed by a reaction norm. In Fig. [7]3, the 
reaction norm is relatively narrow, smoothing the fitness 
landscape. But that smoothing is not enough to place ei- 
ther genotype on the nonzero fitness surface. Both geno- 
types still die out. 

The broader reaction norm in Fig. ^ smooths the fit- 
ness surface more widely. That additional smoothing al- 
lows the nearby genotype to survive. Subsequent small 
genetic variations would allow natural selection to drive 
the surviving population up the path of increasing fitness 
to the fitness peak. 

The distant (red) genotype cannot survive even with 
the broad reaction norm of Fig.jT]^. The contrast between 
the nearby and distant genotypes emphasizes a key point. 
A genotype must be sufficiently close to the nonzero part 
of the smoothed fitness surface in order for the develop- 
mental variation of the reaction norm to allow survival — 
the touching of the fitness surface. If a genotype touches 
the fitness surface, then it can seed a population in which 
small genetic variations allow subsequent adaptation by 
climbing the surface to the peak. 

In a high dimensional space, any single genotype is un- 
likely to be located sufficiently close to a fitness peak after 
a significant change in the environment or in response to 
an unpredictable challenge. Synergism between genetic 
variation and the phenotypic variation of reaction norms 
provides one solution to this search problem. 

Fig. |8] illustrates synergism between genetic and phe- 
notypic variation. The dots represent different geno- 
types. Each genotype has a different combination of av- 
erage phenotypic values in two dimensions. The array of 
dots shows the genetic diversity in the population. The 
smoothed fitness surface has the same fitness peak and 
reaction norm shape as in Fig. [T]^. In Fig. [Sj the location 
of the fitness surface varies in the different plots, illus- 
trating different environmental challenges. No matter 
where the newly favored fitness surface arises upon envi- 
ronmental challenge, the genetic diversity in the popula- 
tion provides at least one genotype on the nonzero part 
of the novel fitness surface. Those genotypes on the sur- 
face can survive the novel challenge. Subsequent small 
genetic variations around a surviving genotype allow the 
population to evolve up the fitness surface to the peak 
set by the novel environmental challenge. 

Synergism between genetic and phenotypic variation 
divides the adaptive search problem into three parts. Ge- 
netic variation covers widely separated locations in the 
phenotype space. Reaction norms cover the phenotype 
space around each genotype. Any genotype on a nonzero 
part of a novel fitness surface can survive and subse- 
quently adapt by small genetic variations and natural 



Box 4. Vertebrate immunity 



Invading pathogens present a vast diversity of foreign 
molecules that must be recognized. The vertebrate adaptive 
immune system develops antibodies by synergism between 
phenotypic and genetic variation, following the general three- 
part search process described in the text [54l [55] . 

First, to generate genetic diversity, B cell lineages within 
the body undergo programmed genetic recombination early 
in life. That recombination yields genetically distinct cellular 
clones. Each clone produces a distinct antibody. 

Second, each antibody type from this initial diversity 
tends to bind relatively weakly to a variety of foreign antigens. 
In this regard, the original or "natural" antibodies trade the 
cost of weak binding for the benefit of a phenotypically di- 
verse response — a broad reaction norm. Upon challenge with 
a foreign antigen, those B cells with matching antibodies are 
stimulated to expand clonally. That clonal expansion can be 
thought of as survival and reproduction of those genotypes 
that land on the fitness surface imposed by the unpredictable 
invader. 

Third, the weakly binding antibodies undergo a pro- 
grammed round of hypermutation to the antibody binding 
site and selection favoring variants that bind more tightly 
to the foreign antigen. This affinity maturation produces 
tightly binding and highly adapted antibodies in response to 
the novel challenge. Put another way, the initially stimulated 
antibodies on the edge of the "fitness surface" climb the sur- 
face toward the fitness peak. 

In the process of climbing the fitness peak by local genetic 
variation and natural selection, the refined antibodies match 
more closely to the environmental challenge. In particular, 
the refined antibodies narrow their reaction norm by increas- 
ing their binding affinity for close matches and reducing their 
binding affinity for slightly mismatched binding. 

In summary, the ability of the adaptive immune system to 
respond to the huge diversity of potential challenges depends 
on its synergism between genetic variability and the reaction 
norm. The initial natural antibodies arise from genetically 
diverse clones produced by recombination. That genetic di- 
versity by itself could not cover the huge space of possible 
challenges. The broad reaction norm around each genetic 
variant allows protection against novel challenge. Once par- 
tial recognition is achieved through the natural antibodies, 
the system refines the match locally by affinity maturation. 



selection. See Box [4] for an example of the synergism 
between genetic and nonheritable phenotypic variation. 



V. MATCHING THE ENVIRONMENT BY 
PLASTICITY OR LEARNING 

To survive a novel environmental challenge, a pheno- 
type must be near the nonzero part of the new fitness 
landscape. A population may survive by having a vari- 
ety of genotypes that produce different phenotypes, in- 
creasing the chance that at least one of the phenotypes 
will be close to a new fitness peak. Alternatively, a sin- 
gle genotype may be able to produce diverse phenotypes 
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FIG. 7. Fitness in two phenotypic dimensions after challenge by a novel or extreme environment. The peak corresponds to 
the favored phenotype after environmental challenge. The black dot shows the average phenotype for a genotype near the new 
fitness peak. The red dot shows the average phenotype for a genotype relatively far from the new peak. A sufficiently broad 
reaction norm allows the nearby genotype to survive, providing an opportunity for natural selection to drive the population 
up the smoothly increasing path to the new fitness peak. By contrast, the distant genotype cannot survive the environmental 
challenge, (a) The fitness landscape, /(x), showing the direct relation between phenotype and fitness when not smoothed by a 
reaction norm, (b) A relatively narrow reaction norm smooths the fitness peak, F{x), but not sufficiently to allow the nearby 
genotype to survive, (c) A broader reaction norm allows the nearby genotype to survive, with subsequent opportunity for 
natural selection to drive the population to the peak, (d) An increasingly broad reaction norm causes the smoothed fitness 
peak to sink mostly below the fitness truncation level, so that the nearby genotype cannot survive. All plots show a bivariate 
normal fitness surface with mean (1/2, 1/2) and variance +7^, with = 0.01 for the fitness landscape, and 7^ for the 
reaction norm of 0,0.01,0.07,0.11 for plots left to right. The heights are the natural logarithm of fitness, with a truncation 
base of log (10). The nearby black dot is at (17/40, 17/40), and the far red dot is at (11/40, 11/40). 




Phenotype 

FIG. 8. Synergism between the reaction norm and genetic variation allows rapid adaptation to novel or extreme environments. 
In this case, a population has multiple genotypes, each genotype located at one of the dots. The smoothed fitness surface 
is sufficiently broad that, for any location of the fitness peak after environmental challenge, the fitness surface touches at 
least one of the genotypes. The genotypes that touch the fitness surface survive, allowing the surviving population the 
potential subsequently to evolve up the fitness surface to the new peak. The dots are located at all bivariate pairs from 
{(5 + 6i) /40, (5 + 6j) /40} for i, j = 1, . . . , 4. The fitness surface has variance + 7^ = 0.08, as in Fig. 



by matching phenotypic expression to the particular en- 
vironment. The developmental flexibility to match envi- 
ronments may arise by phenotypic plasticity or learning. 

Plasticity or learning may not be able to match exactly 
a novel or extreme environmental challenge. But if a 
developmental response to the environment can move the 
phenotype sufficiently close to the nonzero part of the 
new fitness landscape, then the genotype may survive and 
subsequently adapt [U |4l |6] . Developmental flexibility is 
simply another process that alters the shape of the fitness 
surface. 

The adaptive search problem has three phases, simi- 
lar to the three aspects of search described in the prior 



section. First, partially matching expression to the en- 
vironment brings the phenotype close to the new fitness 
landscape. Second, random perturbations of phenotype 
occur around the location set by the process of environ- 
mental matching. Third, any genotype on a novel fitness 
surface can survive and subsequently adapt by small ge- 
netic variations and natural selection. 

Fig. [9] illustrates the three aspects of adaptive search. 
Suppose a genotype expresses phenotypes centered at x. 
In the first aspect of adaptive search, a genotype can 
adjust phenotypic expression to match the environment. 
The possible range of phenotypes varies from x—c to x-\-c. 
The phenotype expressed by environmental matching is 
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FIG. 9. Plasticity or learning provides a partial match of phe- 
notype to novel or extreme environmental challenge. A geno- 
type's default expression has average phenotypic value x. An 
individual can modify average trait expression in response to 
the environment. The average expressed phenotype can be 
any value in the range x =b c. The environmental challenge de- 
fines the fitness landscape that relates phenotype to fitness, 
here shown as the solid peak. Phenotypic expression, modu- 
lated by a match to the environment, shifts the phenotype to 
x+c. However, the fitness associated with a phenotype of x+c 
is zero, because that value remains outside the range of viable 
phenotypes. Suppose x+c is the average phenotype expressed, 
and random perturbations of expression cause variability in 
phenotype around that average value. The random compo- 
nent of phenotype smooths the fitness landscape, leading to 
the dashed fitness surface. The expressed average phenotype, 
x + c, now falls within the smoothed fitness surface, allowing 
the genotype to survive. Subsequent adaptation may allow 
improved fitness, by altering the range of phenotypes that 
can be expressed so that a match to the fitness peak may be 
achieved. 



the new average value, around which random perturba- 
tions may occur. In the figure, the solid peak shows the 
fitness landscape imposed by a novel or extreme environ- 
mental challenge. The example genotype can come close 
to the new peak by modulating expression to produce an 
average phenotype of x + c. However, if no random vari- 
ation occurs around x + c, that phenotype falls outside 
the range of phenotypic values that can survive. 

In the second aspect of adaptive search, the genotype 
may produce phenotypes randomly distributed around 
the mean value of x + c. Those random fluctuations 
smooth the fitness landscape, shown by the dashed curve. 
The average phenotype x + c can now survive. Matching 
the environment allowed expression of mean phenotype 
x + c, and random fluctuations in phenotype smoothed 
the nearby landscape sufficiently. 

Once the genotype achieves survival, the third phase 
of adaptation may proceed. In this case, the mean phe- 
notype x + c has low fitness on the dashed fitness surface. 
But the fitness surface has a smoothly increasing path to 
the peak of maximum fitness. Genetic variations in the 
genotype may shift the range of phenotypes that can be 
produced, allowing natural selection to drive the popula- 
tion up the fitness surface to the peak. 



Box 5. A simple prediction 



Frequent exposure to novel or extreme environmental 
challenge favors a greater capacity for evolutionary response. 
Faster evolutionary response may be achieved by enhanced 
generation of genetic variability. For example, in experimen- 
tal evolution studies, strong competition favors strains of bac- 
teria with a high mutation rate [56 . Strong competition can 
be thought of as a form of extreme challenge. However, inter- 
ference between genetically distinct competing clones compli- 
cates the association between mutation rate and evolutionary 
rate [57 . 

Faster evolutionary response may also be achieved by en- 
hanced phenotypic variability through learning or plasticity. 
However, it may be difficult to test empirically whether learn- 
ing or plasticity are direct responses to increased environmen- 
tal challenge. 

The problem is that learning and plasticity tend to be 
complex adaptations. To build those complex adaptations 
takes a long time and many changes. That slow process means 
that one would have to measure the change in evolutionary 
pressure over the long period during which learning or plas- 
ticity evolve. Ultimately, one would need direct measures of 
environmental pressure in periods of weak challenge and in 
periods of strong challenge, and measures of the time course 
of change in learning or plasticity. That is hard to do. 

A simpler prediction concerns the rapid increase in the 
reaction norm in response to an environmental challenge. 
If a population encounters a novel or extreme environmen- 
tal challenge, it may respond by broadening its reaction 
norm. A broader reaction norm can be achieved simply by 
increasing the tendency for stochastic perturbations during 
development — a breakdown in the normal homeostatic pro- 
cesses that keep phenotypes within narrow bounds. This pre- 
diction about the broadening of the reaction norm follows im- 
mediately from Holland's [12 contrast between exploitation 
and exploration and the quote given in the text by Haldane 
[11] . Many specific models analyze how the reaction norm re- 
sponds to environmental challenge. Lande [39^ provided new 
models and a good overview of the literature. 

In this case, one simply needs to measure the change in 
environmental pressure over short periods of time and the 
associated change in the reaction norm of the population. It 
would also be interesting to measure how a broader reaction 
norm acts synergistically with genetic variation to speed the 
evolutionary response. 



VI. CONCLUSION 

Evolutionary theory emphasizes genetic variation as 
the source of evolutionary novelty. By the standard the- 
ory, the usual sequence would be a novel environmen- 
tal challenge, genetic variation either already present or 
arising de novo, and evolutionary response to the novel 
environment by change in gene frequency. 

In this classical evolutionary theory, genetics provides 
the source of phenotypic variation on which natural se- 
lection acts. By contrast, development may generate the 
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phenotypic novelty that initiates adaptation to environ- 
mental challenge. The sequence would be novel envi- 
ronmental challenge, initial survival by those individu- 
als with a phenotypic norm of reaction that overlaps the 
new fitness surface, and subsequent adaptation by ge- 
netic variants from those phenotypes that survive the 
initial challenge. 

West-Eberhard [1 traces the theoretical foundations of 
this topic from the late 19th century. Since that time, the 
idea that developmental processes may play a key role in 
initiating adaptation has never been popular. Evolution- 
ary change is usually tied in thought to genetic change. 
Nonheritable phenotypic variation by itself is therefore 
usually believed not to accelerate evolutionary rate. 

The original theories of learning, developmental plas- 
ticity, and reaction norms have always understood the re- 
lations between genotype, phenotype, environment, and 
evolutionary change. However, the jargon from those the- 
ories is thick: the Baldwin effect, genetic assimilation, re- 
action norms, hopeful monsters, niche construction, and 
environmentally induced evolution. Each variant theory 
invoked special environmental conditions, developmental 
processes, and interactions with genetics. And each in 
its own way jousted with the ghost of Lamarck. A casual 
observer could be forgiven for steering clear of the whole 
mess. Wisdom suggested to wait for clear empirical ex- 
amples. Induction still dominates mainstream thought 
in biology. 



Many years ago, I read Hinton and Nowlan's [8^ article 
and Maynard Smith's [58^ related essay on the Baldwin 
effect. They focused on the essential theoretical point. 
Learning smooths the fitness surface, changing evolution- 
ary dynamics in a way that greatly accelerates adaptation 
to novel or extreme environmental challenges. When one 
views the whole confusing field in that simple light, one 
sees that all the complexities of the theories and mecha- 
nistic details of phenotypic variability ultimately reduce 
to the same point. Developmental variation smooths the 
fitness landscape. A smoothed fitness landscape pro- 
foundly alters evolutionary dynamics, particularly in re- 
sponse to novel or extreme environmental challenge. 
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